DOA-informed source extraction in the presence of competing talkers and background noise
نویسندگان
چکیده
A desired speech signal in hands-free communication systems is often degraded by noise and interfering speech. Even though the number and locations of the interferers are often unknown in practice, it is justified to assume in certain applications that the direction-of-arrival (DOA) of the desired source is approximately known. Using the known DOA, fixed spatial filters such as the delay-and-sum beamformer can be steered to extract the desired source. However, it is well-known that fixed data-independent spatial filters do not provide sufficient reduction of directional interferers. Instead, the DOA information can be used to estimate the statistics of the desired and the undesired signals and to compute optimal data-dependent spatial filters. One way the DOA is exploited for optimal spatial filtering in the literature, is by designing DOA-based narrowband detectors to determine whether a desired or an undesired signal is dominant at each time-frequency (TF) bin. Subsequently, the statistics of the desired and the undesired signals can be estimated during the TF bins where the respective signal is dominant. In a similar manner, a Gaussian signal model-based detector which does not incorporate DOA information has been used in scenarios where the undesired signal consists of stationary background noise. However, when the undesired signal is non-stationary, resulting for example from interfering speakers, such a Gaussian signal model-based detector is unable to robustly distinguish desired from undesired speech. To this end, we propose a DOA model-based detector to determine the dominant source at each TF bin and estimate the desired and undesired signal statistics. We demonstrate that data-dependent spatial filters that use the statistics estimated by the proposed framework achieve very good undesired signal reduction, even when using only three microphones.
منابع مشابه
A Novel DOA Estimation Approach for Unknown Coherent Source Groups with Coherent Signals
In this paper, a new combination of Minimum Description Length (MDL) or Eigenvalue Gradient Method (EGM), Joint Approximate Diagonalization of Eigenmatrices (JADE) and Modified Forward-Backward Linear Prediction (MFBLP) algorithms is proposed which determines the number of non-coherent source groups and estimates the Direction Of Arrivals (DOAs) of coherent signals in each group. First, the MDL...
متن کاملSpectral and temporal changes to speech produced in the presence of energetic and informational maskers.
Talkers change the way they speak in noisy conditions. For energetic maskers, speech production changes are relatively well-understood, but less is known about how informational maskers such as competing speech affect speech production. The current study examines the effect of energetic and informational maskers on speech production by talkers speaking alone or in pairs. Talkers produced speech...
متن کاملSpeech production modifications produced by competing talkers, babble, and stationary noise.
Noise has an effect on speech production. Stationary noise and babble have been used in the past but the effect of a competing talker, which might be expected to cause different types of disruption, has rarely been investigated. The current study examined the acoustic and phonetic consequences of N-talker noise on sentence production for a range of values of N from 1 (competing talker) to infin...
متن کاملAcoustic correlated sources direction finding in the presence of unknown spatial correlation noise
In this paper, a new method is proposed for DOA estimation of correlated acoustic signals, in the presence of unknown spatial correlation noise. By generating a matrix from the signal subspace with the Hankel-SVD method, the correlated resource information is extracted from each eigen-vector. Then a joint-diagonalization structure is constructed of the signal subspace and basis it, independent...
متن کاملStrategies adopted by talkers faced with fluctuating and competing-speech maskers.
Studying how interlocutors exchange information efficiently during conversations in less-than-ideal acoustic conditions promises to both further the understanding of links between perception and production and inform the design of human-computer dialogue systems. The current study explored how interlocutors' speech changes in the presence of fluctuating noise. Pairs of talkers were recorded whi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- EURASIP J. Adv. Sig. Proc.
دوره 2017 شماره
صفحات -
تاریخ انتشار 2017